Corpus: eng_newscrawl-public_2018

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 279662 S-
2 235925 M-
3 219930 C-
4 206608 A-
5 198600 B-
Top Character Bigrams
word rank frequency n-gram
1 80950 Ma-
2 55338 co-
3 48148 Ch-
4 47291 Co-
5 44913 re-
Top Character Trigrams
word rank frequency n-gram
1 33027 www-
2 20346 Mar-
3 17611 pro-
4 16198 con-
5 15987 The-
Top Character 4-Grams
word rank frequency n-gram
1 32947 www.-
2 13060 http-
3 11621 non--
4 11120 The-
5 7548 anti-
Top Character 5-Grams
word rank frequency n-gram
1 10280 http:-
2 6329 anti--
3 5188 John-
4 4630 inter-
5 4621 Chris-
339897 msec needed at 2025-02-05 13:35